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Abstract. We consider the exit problem for small white noise perturbation 
of a smooth dynamical system on the plane in the neighborhood of a hyper- 
bolic critical point. We show that if the distribution of the initial condition 
has a scaling limit then the exit distribution and exit time also have a joint 
scaling limit as the noise intensity goes to zero. The limiting law is computed 
explicitly. The result completes the theory of noisy heteroclinic networks in 
two dimensions. The analysis is based on normal forms theory. 



1. Introduction 

Small stochastic perturbations of continuous deterministic dynamical systems 
have been studied intensively for several decades. One of the greatest achievements 
in the area is the celebrated Freidlin-Wentzell (FW) theory that allows to explain 
long-term behavior of systems with several meta-stable states at the level of large 
deviation estimates [5J. 

An interesting situation where one can prove more precise estimates than those 
provided via FW quasi-potential approach was considered by Kifer [5]. He stud- 
ied the exit problem for small noise perturbations of a deterministic system in a 
neighborhood of a hyperbolic fixed point (or, saddle) in R d assuming that the start- 
ing point for the diffusion belongs to the stable manifold of the fixed point. Kifer 
showed that as the noise level e decays to 0, the diffusion tends to exit along the 
invariant manifold associated to the leading eigenvalue A+ of the linearization of 
the system even in the presence of other unstable directions. He also found that 
the random exit time r e is asymptotic in probability to A7 lne -1 . 

When studying noisy perturbations of systems with heteroclinic networks, i.e., 
multiple saddle points connected by heteroclinic orbits, Bakhtin [2], [I], realized 
that to understand the vanishing noise behavior of the system, one has to extend 
Kifer's work and analyze (i) the limiting distribution of the approximation error r e — 
In e^ 1 ; (ii) the limiting scaling laws of the exit distribution for the neighborhood 
of each saddle. In fact, the exit distribution for the first saddle point serves as the 
entrance distribution for the next saddle point, so that the peculiarities of the exit 
distribution can significantly influence the further evolution of the system. 

The detailed analysis of scaling limits for distributional Poincare maps near sad- 
dle points carried out in [5] resulted in a complete theory for noisy heteroclinic 
networks. This theory explains interesting non-Markovian limit effects and the 
emerging patterns in the winnerless competion in the process of sequential deci- 
sion making (here, we are using the terminology from where applications of 
heteroclinic networks to neural dynamics are considered). The main result is that 
under the logarithmic time scaling the diffusion process converges in distribution 
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in a special topology to a precisely described limiting process that jumps between 
the saddles along the heteroclinic connections. 

The core result that was applied in [2] iteratively for sequences of saddle points 
connected to one another, is a lemma that computes the asymptotic scaling of 
the exit distribution for a neighborhood of a saddle point given the scaling of the 
entrance distribution. The proof of that lemma was based on a coordinate change 
conjugating the driving drift vector field to a linear vector field. Although this 
method and the lemma based on it apply in a fairly generic situation where the so 
called no-resonance condition holds, there are interesting cases such as Hamiltonian 
dynamics where the smooth linearization is not possible due to resonances. In 
these cases, the system remains nonlinear even under the optimal smooth change 
of coordinates, but it has a certain special structure that can be studied using the 
classical theory of normal forms (see, e.g., [5] . [7] . [10] ) . 

In this paper, we extend the key lemma of [5] to cover the resonant cases and, in 
fact, to the complete generality in the case d = 2. Our approach is based on normal 
forms that have particularly nice structure in the 2-dimensional case. We believe 
that the main result of the present paper can be extended to higher dimensions. 

An important consequence of our result is that in 2 dimensions the no-resonance 
restriction is completely removed from the theory of noisy heteroclinic networks 
developed in [2J, so that the theory applies to any heteroclinic networks generated 
by smooth vector fields on the plane. It also provides a generalization of [3] and [9] 
in 2 dimensions. 

The structure of the paper is the following. In Section[2]we introduce the setting. 
In Section [3] we state the main theorem and split the proof into several parts. In 
Section U we introduce a simplifying change of coordinates in a small neigborhood 
of the saddle point. The analysis of the transformed process in Section [5] is based 
upon two results. Their proofs are given in Sections |6] and [3 

Acknowledgments. The authors are grateful to Vadim Kaloshin for his advice 
on normal forms and for pointing to 7 . The work of Yuri Bakhtin is supported by 
NSF through a CAREER grant DMS-0742424. 

2. Setting 

Let us consider a C°°-smooth vector field b on R 2 and a C 2 -smooth matrix valued 
function a : M 2 —> M 2x2 . Let W be a standard 2-dimensional Wiener process. In 
order to ensure that the stochastic Ito equation 

(1) dX £ = b(X e )dt + ea(X e )dW 

has a unique global strong solution, our first assumption is that both b and a arc 
Lipschitz and bounded, i.e., there is a constant L > such that 

\a(x)-a(y)\V\b(x)-b(y)\<L\x-y\, i.jeK 2 , 
\a(x)\ V |6(a;)| < L, x e R 2 , 

where | • | denotes the Euclidean norm for vectors and Hilbert-Schmidt norm for 
matrices. These conditions can be weakened, but we prefer this setting to avoid 
multiple localization procedures throughout the text. For a general background on 
stochastic differential equations see, for example, [8]. 
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We shall denote by S — (S*)teR the flow generated by b: 

— S*x = biS'x), S°x = x. 

Let V be a domain in R 2 with piecewise C 2 boundary. We assume that the 
origin belongs to V and it is a unique fixed point for S in V, or, equivalently, a 
unique critical point for b in V. Therefore, 

b(x) = Ax + Q(x), 

where A = Db(0) and Q is the non-linear part of the vector field satisfying |Q(x)| = 
0{\x\ 2 ), x^O. 

We assume that is a hyperbolic critical point, i.e. the matrix A has two eigen- 
values A+ and —A- satisfiying — A_ < < A+. Without loss of generality, we 
suppose that the canonical vectors are the eigenvectors for the matrix, so that 
A = diag(A+,-A_). 

According to the Hadamard-Perron Theorem (see e.g. QI3 Section 2.7]), the 
curves W s and W" defined via 

W s ={i€l 2 : |fl*a:| -> as t -> oo} 

and 

W" = {i£i 2 : \S*x\ -> as t -> -oo} 
are smooth, invariant under S and tangent to ei and, respectively, to ei at 0. The 
curve W s is called the stable manifold of 0, and W" is called the unstable manifold 
of 0. 

We assume that W" intersects dV transversally at points q+ and q_ such that 
the segment of W" connecting q_ and g+ lies entirely inside V and contains 0. 
We fix a point xo € W s D V and equip (H} with the initial condition 

(2) X e (0) = x + e a ^, e>0, 

where a G (0, 1] is fixed, and (£e) £ >o is a family of random vectors independent of 
W, such that for some random vector £ , £ e — > ^ as e — >• in distribution. 
If a 7^ 1, then we impose a further technical condition 

(3) P{£o || b(x )} = 0, 

where || denotes collinearity of two vectors. 

We are studying the exit problem for the diffusion process X e in V. We are 
interested in the asymptotic distribution of the random point of exit of X c from V 
given by X e (rY), where is the stopping time defined by 

tY = tY(x ) = in£{t > : X e (t) g dV}. 

3. Main Result. 

The main result of the present paper is the following: 

Theorem 1. In the setting described above, there is a family of random vectors 
(0e)e>O; a family of random variables (ip e ) e >0) an d a number 

{ 1, aA_ > A, 

such that 
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The random vector 

converges in distribution as e — > 0. 

The distribution of ib t ,4> t , and the distributional limit of e will be described 
precisely. 

The proof of Theorem [1] has essentially three parts involving the analysis of 
diffusion (i) along W s ; (ii) in a small neighborhood of the origin; (hi) along W u . 

The first part is based on a Theorem borrowed from [2 Lemma 9.2]. To state 
the theorem, we need to introduce Q x (t) as the linearization of S along the orbit 
of 2 GR 2 , i.e. we define $> x (t) to be the solution to the matrix ODE 

-^U x (i) = A(t)$ x (t), $ x (0) = 7, 
at 

where A{t) = Db(S t x). The theorem reads as: 

Theorem 2. Let x £ M. 2 and (£,e) e>0 be a family of random vectors independent of 
W and convergent in distribution, as e — > 0, to £o- Suppose a £ (0, 1] and let X e 
be the solution of the SDE |T]) with initial condition X e (0) = x + e Q £ e - Then, for 
every T > 0, the following representation holds true: 

X e (T) = S T x + e a ^, e>0, 

where 

r Law ~z ~ 

& — > Co, e 0, 

with 

N being a Gaussian vector: 

N = $ X (T) f ^ x (s)- 1 a{S s x)dW{s). 
Jo 

If a = 1 or assumption (J3)) holds, then P{£o II o(5' T a;)} = 0. 

The second part of the analysis is the core of the paper. Theorem [3] below 
describes the behavior of the process in a small neighborhood U of the origin. 
Notice that since xq £ W s , one can choose T large enough to ensure that that 
S T xo £ W s (~1 U. Therefore, the conditions of the following result are met if we use 
the terminal distribution of Theorem[2] (applied to the initial data given by ©) as 
the initial distribution. 

Theorem 3. There are two neighborhoods of the origin U C V C V , two positive 
numbers S < 5' , and C 2 diffeomorphism f : U' — > (— S',S') 2 , such that f(U) = 
(—S,S) 2 and the following property holds: 

Suppose x £ W s n U, and (£e)e>o is a family of random variables independent 
of W and convergent in distribution, as e — > 0, to ^q, where £ satisfies ^ with 
respect to x. Assume that a £ (0, 1] and that X e solves ([T]) with initial condition 

(5) X e (0)=x + e%, 

where £ e satisfies condition with respect to x. 
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There is also a family of random vectors (4>' e ) e >0, and a family of random vari- 
ables (f/>e)e>0) such that 

X e (rf)= 3 (sgn(^ei)+eV e , 
where g = f^ 1 , (3 is defined in (UJ), and the random vector 

converges in distribution as e — > 0. 

The notation for and its components is chosen to match the notation involved 
in the statement of Theorem[T] Random elements ip^fi'e an d the distributional limit 
of Oj will be described precisely, see (|2"T|) . Obviously, the symmetry or asymmetry 
in the limiting distribution of results in the symmetric or asymmetric choice of 
exit direction so that the exits in the positive and negative directions are equiprob- 
able or not. On the other hand, the limiting distribution of determining the 
asymptotics of the exit point can also be symmetric or asymmetric which results 
in the corresponding features of the random choice of the exit direction at the next 
saddle point visited by the diffusion. 

In Section [5] we prove Theorem using the approach based on normal forms. 

The last part of the analysis is devoted to the exit from V along W" . We need 
the following statement which is a specific case of the main result of pQ. 

Theorem 4. In the setting of Theorem^ assume additionally that (i) q — S T x G 
dV ; (ii) there is no t € [0,T) with S l x £ dV ; (Hi) b(q) is tranversal (i.e. not 
tangent) to dV at q. Then 

(6) tY 4 T, c -► 0, 
and 

(7) e- a (X e ( T Y)-q) L -r^ Q , e^O, 

where 7r denotes the projection along b{q) onto the tangent line to dV at q. 

Now Theorem[T]follows from the consecutive application of Theorems[2]through|4] 
and with the help of the strong Markov property. In fact, in this chain of theorems, 
the conclusion of Theorem [2] ensures that the conditions of Theorem [3] hold, and 
the conclusion of the latter ensures that the conditions of Theorem |4] hold. Notice 
that the total time needed to exit V equals the sum of times described in the three 
theorems. Notice also that at each step we can compute the limiting initial and 
terminal distributions explicitly. Theorems [2] and [4] contain the respective formulas 
in their formulations, and the explicit limiting distribution for of Theorem [3] is 
computed in (|2"T1) . 

4. Simplifying change of coordinates 

In this section we start analyzing the diffusion in the neighborhood of the saddle 
point. The first step is to find a smooth coordinate change that would simplify the 
system. This can be done with the help of the theory of normal forms. 

Let j be a C°° diffeomorphism of a neighborhood of the origin with inverse /. 
When X t is close to the origin and belongs to the image of that neighborhood under 
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g, we can use Ito's formula to see that Y c — f(X e ) satisfies 

dY t = Df(X e )dX e + ±[Df(X £ ),X e ] 

= Df(g(Y e ))b(g(Y e ))dt + ea{Y e )dW + e 2 V(Y e )dt, 

for some smooth function "J : R 2 — > R 2 and a = ((Df)og)a. Here D denotes 
the Jacobian matrix, and the square brackets mean quadratic covariation. Since 
Df o g = (Dg)^ 1 , we can rewrite the above SDE as 

(8) dY e = ([DgiYt))- 1 b(g(Y £ )) + e 2 *(F £ )) dt + ed(Y e )dW. 

The idea now is to choose a transformation g (or, equivalently, /) that makes 
the drift in equation (JSJ) easy to estimate. We are going to use the normal form 
theory and so we need to recall certain terminology, notation and results from [7J 
putting them in the (two-dimensional) context of this paper. 

A pair of complex numbers A = (Aj., A2) is said to be non-resonant if there are 
no integral relations between them of the form Aj — a- A, where a — (cui, a-i) G 1? + 
is a multi-index with \a\ = a.\ + 012 > 2. Otherwise, we say that it is resonant. 
Moreover, a resonant A is said to be one-resonant if all the resonance relations for 
A follow from a single resonance relation. A monomial X Gj — X -y X 2 Gj IS called 
a resonant monomial of order R if a • A = Aj and |a| = R. Normal form theory 
asserts (see [7], [5]) that for any pair of integers R > 1 and k > 1, there are two 
neighborhoods of the origin fl f and fl g and a C fc -diffcomorphism / : £1/ — >• fl g with 
inverse g : fi s —> f2/ such that 

(9) (Dg(y)r 1 b(g(y)) = Ay + P(y)+K(y), y G Q g 

where P is a polynomial containing only resonant monomials of order at most R 
and 1Z(C) — 0(|C| i?+1 ). If A is non-resonant, then / can be chosen so that both 
P and 1Z in §§§ are identically zero. Moreover, due to [7J Theorem 3, Section 2], if 
A is one-resonant then / can be chosen so that 1Z in ([9]) is identically zero. More 
precisely, if A is a one-resonant pair, then for any pair of integers R > 1 and k > 1, 
there are two neighborhoods of the origin fl / and tt g and a C fc -diffeomorphism 
/ : flf — ¥ £l g with inverse g : Q g — > fi/ such that 

(10) (Dg(y)r 1 b(g(y)) = Ay + P(y), y€fl g , 

where P is a polynomial that contains only resonant monomials. 

Note that (A+, — A_) is either non-resonant or one-resonant (resonant cases that 
are not one-resonant are possible in higher dimensions where pairs of eigenvalues 
get replaced by vectors of eigenvalues). The non-resonant case (in any dimension) 
was studied in [2]. In this paper, we extend the analysis of [2] to the non-resonant 
case, i.e. the one-resonant case, given that we are working in 2 dimensions. 

To find all resonant monomials of a given order r > 2, we have to find all the 
integer solutions to the two 2x2 systems of equations: 

«iA + — 0:2^- = ±A±, 
a i + a 2 = r - 
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Therefore, the power multi-indices of a resonant monomial of order r has to coincide 
with one of the following: 

(11) K(r),a+(r)) = A+ 1 - (A + + rA_, (r - 1)A + ), 

(12) (ar(r),a 2 -(r)) = L_(( r - 1)A_, rA+ + A_), 

Let us make some elementary observations on integer solutions of these equations 
for r > 2. 

(1) None of the solution indices can be 0. Moreover, neither 0£j(r) nor o^~(r) 
can be equal to 1. 

(2) As functions of r, af(r) are increasing. 

(3) Expressions (fTTj) and (IT21 cannot be an integer for r = 2. 

(4) The term P = (P ll P 2 ) in JIDJ satisfies Px(y) = 0(yf|y 2 |) and P 2 (y) = 

This observation is a consequence of observations 1 and 3 since 
they imply that resonant multi-indices have to satisfy a + (r) > (2, 1) and 
oc~(r) > (1,2) coordinatewise. 

(5) If at least one of the coordinates y\ and y 2 is zero, then P(yi, y 2 ) = 0. This 
is a direct consequence of the previous observation. 

Given all these considerations, the main theorem of this section is a simple conse- 
quence of [7]. 

Theorem 5. In the setting described in Section^ there is a number 6' > 0, a 
neighborhood of the origin U' , and a C' 2 -diffeomorphism /:[/'—> (— o 7 , S') with 
inverse g : (— <5',<5') 2 — > U' such that the following property holds. 
If X e (0) <E U, then the stochastic process Y e — (Y 6j i,Y et2 ) given by 

Y e (t)=f(X e (tA T U)) 

satisfies the following system of SDEs up to : 

(13) dY e ,i = {X+Y eA +Hi{Y e ,e))dt + eai(Y e )dW 

(14) dY ta = {-\-Y t .2 + H 2 (Y £ , e)) dt + ed 2 {Y t )dW, 

where cr, : (—(5', S') 2 — > M ore C 1 functions for i = 1,2. XTie functions Hi are given 
by Hi = + e 2 ^, where : (— <5',<5') 2 — > R 2 are continuous bounded functions, 
and Hi : (—6', S') 2 x [0, oo) are polynomials, so that for some constant K\ > and 
for any y G (S',5 r ) 2 , 

\Hi(y)\ <^i|yil^|y 2 r 2+ , 
|ff 3 (i/)| <^iN ar M a2 ~- 

Here, the integer numbers of, i = 1,2, are smc/i i/iai (a^",aj) is of the form (jlip 
/or some choice of r — r\ > 3, and and (a^a^) is o/ i/ie /orm (|T2|) /or some 
choice r — r 2 > 3. in particular, 

\Hi(y,e)\ <K iy l\y 2 \+K 2 e 2 , 

\H 2 (y, e)| <^i|2/i|y 2 2 + ^ 2 e 2 , 

/or some constants K\ > and > 0. 
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5. Proof of Theorem [3] 

In this section we derive Theorem |3j from several auxiliary statements. Their 
proofs are postponed to later sections. 

Theorem [5] allows to work with process Y € = f(X e ) instead of X e while Y t stays 
m(-5',5') 2 

If we take 6 <E (0, d'), then for the initial conditions considered in Theorem[3jand 
given in ([5]), 

P{X e (0) £ U'} —> 1, e -> 0, 

i.e., 

p{y £ (o)e(-<5',5') 2 }^i, *->o. 

Moreover, denoting f(x) hy y — (0, y%) we can write 

y e (0) = y + € a X c = (e a Xe,i, V2 + e a x e , 2 ), e > 0, 

where Xe — (Xe.iiXe,2) is a random vector convergent in distribution to Xo = 
(Xo, 11X0,2) = Df(x)£ . Due to the hypothesis in Theorem [31 we notice that the 
distribution of xo.i has no atom at 0. 
Let us take any p £ (0,1) such that 

and define the following stopping time: 

f e =inf{t:|y £ ,i(t)|=e°*}. 

Up to time f e , the process X e mostly evolves along the stable manifold W s . 
After f e , it evolves mostly along the unstable manifold W". Process Y e evolves 
accordingly, along the images of W s and W" coinciding with the coordinate axes. 

Let us introduce random variables rjf- via 

% = e- tt(1 - p)A - /A+ y e , 2 (T e ). 
Also we define the distribution of random vector (t/q , t]q ) via 
(16) Vo = Xo,i + l {a =i}N + , 

Vo = \Vo\ X - /X+ V2, 



where 

(17) N+ = e- x - s a 1 (0,e- x - s y 2 )dW 

Jo 

is independent of Xo,i- 

Lemma 6. If the first inequality in (|15l) holds, then 

(18) P{r e4 (f e )-e Qp sgn 77+j^l, e -> 0. 
and 

(19) U+,?77,f e + ^-(l-p)logeJ ^> U/+,77(7,-^-log|r/+ 
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We prove this lemma in Section [6] Along with the strong Markov property, it 
allows to reduce the study of the evolution of Y e after f e to studying the solution 
of system (fT3"]) - (ri"4")) with initial condition 

(20) y £ (0) = (e a Psgn V t,e a ^ x -/ x +7 1 ;), 
where 

(21) (vt,V7)^(vt,Vo), ^0. 
We denote 

(22) r e = Te (S) = M{t > : \Y eA (t)\ = 6}. 

Our next goal is to describe the behavior of Y(t c ). To that end, we introduce a 
random variable 9 via 

' N, aA_ > A + , 

( 28 ) H L = { { 4 1 ) ^ A+ ^ + N, aA_ = A + , 

aA_ < A + . 

where the distribution of N conditioned on t)q, on {sgn^ = ±1} is centered 
Gaussian with variance 

r° 9 

<y±= e 2X ~ s |a 2 (±<5e A + s ,0)| ds. 

J — oo 

Let us also recall that /3 is defined in (U). 

Lemma 7. Consider the solution to system Q13p (1141) equipped with initial condi- 
tions (|20|) satisfying (|21[) . If the second inequality in (115|) holds, then 

(24) P{|y e)1 (r e )|=<5}^l, e^O, 

(25) r £ + ^log6 A ^ log ,5, 

A+ A+ 

(26) ^^(Te) ^ 0. 

Moreover, if < 1, then the convergence in probability also holds. 
A proof of this lemma is given in Section [7] 

Now Theorem [3] follows from Lemmas |5] and [7] In fact, the strong Markov 
property and (fT5)) imply 

P{rf = f e + t € (S)} 1, e^O, 

so that the asymptotics for rj 7 is defined by that of f e and t £ (<5). It is also clear 
that one can set ip'e = 17+, and £ = Dg(sgn(r]+)6ei)Y e (T e ), so that the limiting 
distribution of £ is given by 

/ 1 (5 

(27) D ff (s gn (77+)5 ei )(ee 2 ), ^ log ^+ 

where random variables t]q and 8 are defined in (|16l) and 
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6. Proof of Lemma [6] 

In this section we shall prove Lemma [6] using several auxiliary lemmas. We start 
with some terminology. 

Definition 1. Given a family (£ e ) e >o of random variables or random vectors and 
a function h : (0,oo) — > (0, oo) we say that £ e = O p (h(e)) if for some eo > 
distributions of {£,e/h(e)) 0<e<eg , form a tight family, i.e., for any S > there is a 
constant Kg > such that 

P{|£e| >K S h(e)} <8, 0<e<e o . 

Definition 2. A family of random variables or random vectors (£ e ) e >o is called 
slowly growing as e — > (or just slowly growing) if £ e = Op(e~ r ) for all r > 0. 



Our first lemma estimates the martingale component of the solution of SDEs ([13 
and (fT4l). Let us define 



S+(T) = sup 

t<T 

S~ (T) = sup 

t<T 



e- A + s a 1 (y e ( S ))^( S ) 

t 



T > 0, 



,-X-(t-s)~ 



a 2 {Y t {s))dW{s) 



T > 0. 



Lemma 8. Suppose (r e ) e> o is a family of stopping times (w.r.t. the natural filtra- 
tion of W). Then 

s+(0 = Op(i). 

If additionally (r £ ) £> o is slowly growing, then S~(r e ) is also slowly growing. 

Proof. Both estimates are elementary. The first one is an easy consequence of the 
martingale property of the stochastic integral involved in the definition of , and 
the BDG inequality (see [H Theorem 3.3.28]). As for the second one, we notice that 
the stochastic integral in the definition of S~ behaves essentially like an Ornstein- 
Uhlenbeck process, and similar bounds apply. □ 

Lemma 9. Suppose Y t is the solution of equations 113\ )- [14\ ) with initial conditions 
given by 

(28) y 6 ,i(0) = e a X e.i and F e , 2 (0) = y 2 + e a y e , 2 , 

where distributions of random variables (Xe,i)e>o an d (Xe,2)c>o form tight families. 
Let us fix any R > and denote l e — 77 A (— -y- log e + R) for e > 0. Then 



and the family 



is slowly growing. 



supe- xt \Y e , 1 (t)\ = P (e a ), 



e- a sup\Y t . 2 (t)-e- xt (y 2 + e a X e,2)\ 
t<i € 



£>0 



Proof. The tightness property implies that without loss of generality we can assume 
that \Xe,i\, |Xe,2 1 < C for some constant C > and every e > 0. 

Let us fix 7 > 0. We can use Lemma[5]to take c = 0(7/3) > such that 

P{S+(Z £ ) >C}< 7/2, 
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and 

P{S e -(0 >«r*}< 7/2, 

where q is an arbitrary number satisfying < q < a. Let us introduce a constant 
K = (3c) V C and stopping times 

P + = inf {t > : e~ x+t \Y eil (t)\ > 2Ke a ) , 

/9_ = inf {t > : |n,2(t) ~ e^- t (y 2 + e a X e, 2 )\ > 2Ke a - q } , 

We start with an estimate for Y €t \. Duhamel's principle for lfl3]l. Theorem [5] and 
Lemma [5] imply that the estimate 



supe- A + t \Y eA (t)\<e a K + K 1 / e- A + s F e>1 ( S ) 2 |Y e , 2 ( S )|d S + AT 2 — + eS+(/3) 

t</3 Jo A + 



(29) 



< e"^ + ifx / e- x + s Y eA {s) 2 \Y e , 2 (s)\ds + K 2 ^- + 

Jo A + «* 



holds with probability at least 1— 7/2. We analyze each term in the RHS of equation 

Let us start with the integral in (|29p. For s < f3, we see that 
n,i(*) a |i; i2 (*)| < 4K 2 £ 2 V A + S (|F e , 2 ( S ) - e - A - s (2/ 2 + e a X e,2)\ + e~ x - s \y 2 + e a X e, 2 \) 
< 8K 3 e Sa - q e 2X + s + AK 2 t 2a e [2X+ - x - )s {\y 2 \ + e a C). 

Therefore, 



r P 

K x / e- x + s Y e>1 (s) 2 \Y €)1 (s)\ds < 
Jo 



8K 3 K ie x + R 2 „ 



/■/? 

+ 4K 1 K 2 e 2a (\y 2 \+e a C) e {x +- x - )s ds 

Jo 

rP 

(30) < Xe Q /12 + 5is:i/Y 2 e 2a |y2| / e( A +- A -) s ds 

Jo 

for all e > small enough. Notice that this is a rough estimate, the constants on 
the r.h.s. are not optimal but sufficient for our purposes. This also applies to some 
other estimates in this proof. 

Let us estimate the integral on the r.h.s. of (f3"0")) . When A+ > A_, the integral 
is bounded by 

1 J\+-X-)B. 
1 3 (A+-A_)/3 < £ r a+aA_/A + . 



A+ - A_ - A + - A_ 

if A + < A_, then the integral on the r.h.s of (|30p is bounded by (A_ — A+) -1 ; if 
A + = A_, then the integral is bounded by 2aA^ 1 | loge|. Hence, for some constant 
-^a + ,a_ > and e > small enough, 

rP , 
K x \ e- x+s Y eA (s) 2 \Y^ 2 ( S )\ds <Ke a /12 + K x+ ^e 2a - a ^- x ^ x +^ |loge| 
Jo 

(31) < Ke a /6. 
Also, for e > small enough, 

(32) K 2 e 2 /X+ + eK/3 < Ke a /2. 
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From (|29|) . (f3Tj) and (|32|) we get that for all e > small enough, the event 



A = I supe- A +*|y £! i(i)| < 5Ke a /3 



is such that P(A) > 1 - 7/2. 

Let us now consider Y €f2 (t) and denote 



Z e (t) = y 6)2 (t) - e-^*^ + e Q Xe. 2 ). 

Duhamel's principle for 5^,2, the definition of /?, Theorem [5] and Lemma [S] imply 
that the inequalities 

sup|Z e (i)| <ifisup / e- x -^- s 1\Y eil ( S )\^\Y e . 2 ( S )\^d S + K 2 e 2 /\^+eS-(l3) 

t<0 t</3Jo 

< K x sup / e- x -^\Y eA (s)\^ \Y e , 3 (a)\°* ds 

t<p Jo 

+ e a - q (K 2 e 2 - a+q /\- + J- a+ *S-(J3)) 

(33) <2 a ie aa iK a iK 1 supe- x - t [ e (*-+<*i *+>\Y e 2 (s)\ a ^ ds + e a ~ q K/2 

t<p Jo 

hold with probability at least 1 — 7/2 and for all e > small enough. We analyze 
the integral term in (|33l) . Note that, from the definition of /?, and the inequality 
(a + b) r < 2 r ~ 1 (a r + b r ) we have that for any t < f3 and any e > small enough, 

\Y € , 2 (t)\ a - <2 a *- 1 Z e (t) a * +2 a *- 1 e- a * x - t \y 2 + e a X e,i\ a * 

< 2 2Q 2 - 1 J( a 2 e (. a -<l) a 2 -)- 2 2 ("2 -!) e - Q 2 A -*|y 2 |«J 

_|_ 2 2 ("2 - 1 ) e Qa 2" e - a 2 _A - t |T (e 2 | a 2 

< (a-g)22(«2 -1) (2K a i + e qa * \xe a| a **) + 2 2( - a z e~ a ? x ~ t \y 2 \ a 2 . 
Hence there is a constant -Ka > such that 

|^, 2 (t)r 2 ~ < e a i {a - q) K a + K a e- a * x -\ t < (3. 

Using the last inequality, the definition of /3, and the fact a^X+ — (a 2 — 1)A_ = 
from Theorem [5j we get 



aa i e ~ x - t / e (A - +Q ^ A + )s |r £ , 2 (s)r^s 



< c a(a, +a 2 ) c *+a, J ^" c _ | J<[ aC aa i / g( Q i A+-(a 2 -l)A_)s^ s 

A_ + a^\ + Jo 
K p^+ a i R 

(34) < e( Q -«) Q 2 __£! _ + K a e aa i p. 

A_ + a l A + 

Again, from Theorem[5]we know that > 1 and a 2 > 2 which together with (|34l 
imply that for all e > small enough 

(35) 2 a i e aa i K a i K 1 supe- x - t [ e {x - +a ^ x+)s \Y e 2 (s)\ a ^ ds < Ke a - q /6. 

t</3 Jo 



NORMAL FORMS APPROACH TO DIFFUSION NEAR HYPERBOLIC EQUILIBRIA 13 



Using ([35]) and (|33|) we conclude that the event 



B = ^sup\Y e , 2 (t) - e- x - t (y 2 + e a X e,2)\ < 2Ke a -y3j 

is such that P(-B) > 1 — 7/2, for all e > small enough. 

The proof will be complete once we show that f3 = l c with probability at least 
1 — 7. The latter is a consequence of the following chain of inequalities that hold 
for all e > small enough: 

P{^+ A p- < Q < P ({/3+ A /3_ < l e } n A n B) + P(A C ) + P(B C ) 

< P ({/?+ a /?_ < zj n A n b) + 7 

< P {{(3+ < /3_ A ZJ n A) + P ({/3_ < /3+ A /J n B) + 7 
= P{2 < 5/3} + P{2 < 2/3} + 7-7- 

□ 

Let us now analyze the evolution of the process Y e up to time f e A . We start 
with an application of Duhamel's principle: 



(36) Y eA (t) = e A +%!(0) + / e x +^H 1 (Y e (s),e)d S + ee x + t N+(t), 

Jo 

(37) y e , 2 (t) = e- A -*r e . 2 (0) + / e~ x -^H 2 (Y^ S ),6)d S + eN~(t), 

Jo 

where N^(t) are defined by 

iV+(t) = / e-^axiY^dWis), 
Jo 

(38) iV-(t) = / e - A -(*- s )a 2 (y e ( S ))d^( S ). 



Lemma 10. 

sup|y e . 2 (i)- e - A - t y 2 HO P (e^). 

t</ e 

Proof. Duhamel's princinple, Theorem [5J and the definition of f e imply that for 
some K > 0, 

|y e , 2 (t)-e- A -*itt| <e a |x e , 2 |+ / e -M*-«) (jfil^CaJI^^+ifa^da + eSrC*) 

Jo 

for any t G (0,f e ). The result follows since by Lemma[5]the r.h.s. is Op(e ap ) □ 

As a simple corollary of this lemma, the first statement in Theorem [5] follows: 
Corollary 11. As e -» 0, 

P{rf < f £ } -> 0. 

In particular, (|18p ZioZds irite. 
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Lemma 12. Let 

N+(t)= [ e- x - s d 1 (0,e- x - s y 2 )dW. 
Jo 

Then 

sup \N+(t) - N+{t)\ 0, e^O. 
t<f c 

Proof. BDG inequality implies that for some constants Gy^Cy. > 0, 

Esup \N+(t) -N+(t)\ 2 < dE r e e- 2A + s |^(y e>1 ( S ),y ei2 ( S )) - (0,e~ x - s y 2 )\ 2 d S 
t<f e Jo 

(39) < ^Esupla!^,!^),^^))-^^^-^ 8 ^)! 2 . 

t<f e 

From Lemma [TO] and the definition of f e , it follows that 

(40) sup |(y e ,i(*)»n,2(t)) - (0,e- A -*y 2 )| = Op(^). 
t<f e 



The desired convergence follows now from (1391) . (j40|) . and the boundedness and 
Lipschitzness of &\. □ 

We are now in position to give the first rough asymptotics for the time f c . From 
now on we restrict ourselves to the event {7™ > f e } since due to Corollary [Til its 
probability is arbitrarily high. 



Lemma 13. As e — > 0, 



Te > -— loge J> -> 0. 



Proof. Let u e be the solution to the following SDE: 

du e (t) = X+u e (t)dt + eai(Y e (t))dW(t), 
Ue(0) = e a Xe ,i- 

Let us take <5o £ (0,1) to be specified later and consider the following stopping time 

T e = m{{t:\u e (t)\=e a6 "}. 
Duhamel's principle for u e writes as 

u e (t) = e a e x + t Xt , 1 + ee x + t N+(t) 
= e a e x +%(t), 

with 

(41) Ut)=Xe,x + e 1 - a N+(t). 

Hence, the definition of r e implies e ado = e a e x+T " \rj e (r e )\, so that 

a 1 
r t = -- — (l-5 )loge- — log|»7 e (r e )|. 
A+ A+ 

Due to pX|) and Lemma [TO1 the distributions of log |?/ e (T e )| form a tight family. 
Therefore, 



(42) limP|r £ >-(l-^)— \oge}=0. 



NORMAL FORMS APPROACH TO DIFFUSION NEAR HYPERBOLIC EQUILIBRIA 15 



This fact allows us to use Lemma [9] to estimate Y e up to f e A r e . From (|36|) , the 
difference A e = Y e .i — u t is given by 



A e (i) = e A+t [ e- x + s H 1 (Y e (s),e)ds. 
Jo 



We can use ([41?]) to justify the application of Lemma [3] up to time f € Ar e . Then, we 
combine Theorem Lemma [9j and the definition of f e to see that 

sup e-^lH^nW.e)! sup (( e - A +*|n,i Wl) |n,i(*)l ' l^,a(*)|) + ^e 2 

i<T e Ar e Kf £ AT e 

= Op f e a+Q f) 



and 



„A + f e At 



These two estimates together with (j42|) imply 

sup |A e (t)| = P fe Q ^ +5 ^|loge|) 



{<f e Ar, 

On one hand, (|42]l implies 



P ^|f e > -— logej n{f e <r e }j ->0. 

On the other hand, if f e > r e then 

\Y eA (r e )\ = | e ^+Op( e ^ +A 'o)|loge|) , 

and 

\ye,x(%)\ < t ap - 

These relations contradict each other for sufficiently small e if we choose Sq < p. 
So, this choice of Sq guarantees that P {f e > t £ } — > implying the result. □ 



Proof of Lemma\^ Recall that we work on the high probability event {f e < T^}. 
Hence, for each e > 0, we have the identity 

e ap = e a e x + f '\T 1 ?\. 

Solving for f e and then plugging it back into Y e ^, we get 

(43) f e = ~(l-j>)loge-^-log|»/+| J 

A+ A + 

r £ , 1 (f e ) = e Qp sgn(, ? +). 

Using this information we are in position to get the asymptotic behavior of the 
random variables rjf 1 . First, from relation (|36p we get 

(44) nt = Xe,l + e~ a I ' e- x ^H 1 (Y e (s),e)ds + e 1 - a N+(T e ). 
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Using d43} in (J37J) we get 



V7 = \vt\ X - /X+ (y2 + e a Xe.2) + \vt\ X - /X+ / e x - s H 2 (Y e ( S ),e)ds 

Jo 

(45) +e 1 - a ^ 1 - p ^-^+N-{T e ). 

The main part of the proof is based on representations (l4"51) - (|4l))) . 
Lemma [T51 allows us to use the estimates established in Lemma [S] up to time f e 
In particular, now we can conclude that the family 



(46) [e- a sup \Y e>2 (t) - e- A -*«2 



£>0 



is slowly growing thus improving Lemma [TU1 

To obtain the desired convergence for 77+, we analyze the r.h.s. of (|44[) term by 
term. The covergence of the hrst term was one of our assumptions. For the second 
one, we need to estimate Hi(Y € , e). Using Lemma [§J the boundness of Y e> 2 and the 
definition of f e , we see that 

(47) sup e- A + t Tf 1 (t)|y 6 , a (t)| = P (e Q+Qp ). 

t<t e 

This estimate and Theorem [5] imply that 

e~ a e- x + s H 1 (Y e (s),e)ds < K^ a H e - A +'y e 2 1 («)|Y 6l2 («)|dfl + ^e 2 ~ a 
Jo Jo A + 

= Op(e Q f|loge|). 



Let us estimate the third term in (|44|) . We can use the last estimate along with (|44j) 
and Lemma [TUl to conclude that the distributions of positive part of AT log|r?^| 
form a tight family. Therefore, (1431) implies that 

t c — > oo, e —> (J. 
Combined with Ito isometry and Lemma [T2l this implies 

which completes the analysis of ry+ and, due to (|43|) . of f e . 

To obtain the convergence of rj~ , we study (|4"5)l . Combining (|4"6")l . the inequality 

|Y M (t)|Y e 2 2 (i) < 2|Y e>1 (t)| (|Y e , 2 (i) - e- x -%\ 2 + e-^yl) , 

and the definition of f e we see that for any q S (0, ap), 

sup e A -*|Y e>1 (t)|Y e 2 2 (*) = Op ( e «f+«-9 e A-T, + e a P ) _ 
t<f e 

Hence, as a consequence of Theorem [5] and (|4"5)l we have 

/ ' e x - s H 2 {Y t {s),e)ds = O p (( e «P-«+« e A-n + £ "p) | loge |) 
Jo 

= O p ^ e «(i-(i-f)>-M+)+(«p-<?) +e a^ | loge A _ 
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Combining this and Lemma [8] in (|45|) we obtain 



+ |A-/A 4 



+ 2/2 + Op (e a ) + Op( (>(l-(l-P)A-/A + )+(ap-<2) + £ ajA | log £ 



Ve = n 

+ Op ^ e l-«(l-P)A-/A+-<? X 

which finishes the proof of Lemma [6] by choosing q small enough. □ 

7. Proof of Lemma [7] 

Consider the solution to system (|13p - (fT4")) equipped with initial conditions (|20|) 
satisfying (|2Tj) . Let us restrict the analysis to the arbitrary high probability event 

{\V?\<K±}, 

for some constants K± > 0. 

Lemma 14. Let p G (0, 1) satisfy (| 15(1 . and let (t e ) e> o be a slowly growing family 
of stopping times. Consider t' e — t t A rj 7 , then for any 7 > 0, 

limP \ sup|y e2 (*)| < (K- + 7 ) e Q ( 1 -P) A -/ A + I =1. 
Proof. Let 7 > 0. We recall that N~ is defined in (|38]l and introduce the process 
(48) M e (t) = JV e ~(t) +e / e^-^-J^aCy^sJJda, 



where 'J 2 was introduced in Theorem [5j and the stopping time 

A = inf {i : |y«, a (t)| > (X- + 7 ) e «d-p) A -/ A + } . 

Using the fact that Y eA is bounded, it is easy to see that there is a constant K\_ 
independent of t, so that for any t < (3 e A t[, we have 

e -*-(.*-')\Y eA (a)\Y* 3 (a)da < # a _ £ 2«(i-p)A-A+ . 

This estimate, along with Duhamel's principle and Theorem [5] implies that for some 
constant C > and any t < /3 e A t' e , 

\Y t . 2 {t)\ < e a(1 - p)X - /X +\v7\ + K X f e- x -^\Y eA (s)\Y e 2 2 ( S )ds + e sup \M e (t)\ 

Jo t<ff e 

< e <l-p)\-/\+ K _ + C , £ 2a(l-p)A_/A + + £ gup | M£ ( t )|. 

Hence, using Lemma [8] to estimate M e , we obtain that 

P{/3 e < Q = P J sup |F £ , 2 (<)| > + 7 ) e «(i-p) A -M + I 

[t<£ e At' e J 

< P \ c e «(i-p)A-/A + + e i-a(i-p)A_/x+ sup | Me (^| > 7 I 

{ t<p r _ J 

converges to as e — > proving the lemma. □ 
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Lemma 15. Under the assumptions of lemma \T]\ for any p £ (0, j 2 -], 7 > 0, and 
C > 0, define p t — (— ploge + C) A rj 7 . Then, we have 

limp(sup|r el (0|e- A + t < (l+ 7 )e Qp l = 1. 
U< P « J 

Proof. Define the stopping time 

/3 e = inf{t: |y £il (i)|e- A+t >(l + 7 )e Qp }. 
As a consequence of Duhamel's principle and Theorem [5] we get the bound 

sup |F e ,i(t)|e- A +* <e ap +K x / e- x + s Y e 2 A (s)\Y e , 2 (s)\ds 

t</3 E Ap £ Jo 

+ e 2 A' 2 A; 1 +e5+(/3 e ). 

This estimate together with Lemma 1141 Lemma [5] and the dehntion of p t implies 
that for any small S > we can find a constant K > 0, so that with probability 
bigger than 1 — 5, the inequalities 

sup |F £ ,i(i)|e- A+t <c c * + iife 03H - at ( 1 - p J A -/ A +G8 6 Ap e ) + *"c 

t<P c Ap. 

< e ap {l + 2Kpe a( - 1 ~ p)x - /x +\\oge\ + Ke l ~ ap ), 
hold for all e > small enough. Hence, for any small enough e > 0, 

P{ft<p e } = pi sup |n.i(i)|e~ A+ * > (l + 7)e Q 4 

[t</3 e A Pe J 

< p{jrpe tt ( 1-p > A -/ A +|]oge| +Ke 1 - ap > 7} + 5, 
which implies the result. □ 

The following is an important consequence of Lemma 1141 
Corollary 16. With t € as in (|22[) it holds that 

mnPWy < tA = 0. 

e->0 

In particular, (|24p holds. 

From now on, we restrict our analysis to the high probability event {tY > r e }. 
Let 0+ = e- ap e- A + T «r £ ,i(T e ). Then, implies 

(49) r e = -— log e + — log — , 

A+ a + \y e 1 

and 

F £ , 1 (T e ) = 5sgn^+. 

Our analysis of these expressions will be based on the next formula which directly 
follows from Duhamel's principle: 

(50) 0+ =sgnr)+ + e- ap f ' e~ x + s H^Y^s), e)ds + e^iV+fc). 

Jo 

The main term in the r.h.s. of (|50"|) is sgnr;+. We need to estimate the other two 
terms. Lemma [5] implies that e p N^(T e ) converges to in probability as e — » 0. 
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Let us now estimate the integral term. Relations (|49|) and ([50]) imply that (r c ) e >o 
is slowly growing, and we can use Lemma fl4l to derive 

(51) S np\Y et2 (t)\ = P (e a ^ x -/ x +). 

t<T e 

We can now use Theorem [5] to conclude that 

e- ap sup |JTi (*;(<), e)| = Op( e <*(i-p)A-M+-°p + £ 2-«p) j 

t<T e 

and (fT5|) implies that the r.h.s. converges to 0. Therefore, 

e~ ap [ ' e- x+s Hi(Y e (s),e)ds A 0. 
Jo 

The above analysis of equation (|50l) implies that if we define 0^ = sgn rj^ , then 

(52) 0+ ^ 0+, 

which implies ([2"5"]) due to (|4"9"]) . It remains to prove ([21)]) . 
Duhamel's principle along with ([49]) yields 

(53) ^ ^ 

r e , 2 (r e ) = (^p) + e aX -' x ^: + J\- x -^-^H 2 {Y e {s),e)d S + eN:{r t ). 

In order to study the convergence of N~(r e ) we first give a preliminary result. 
Lemma 17. 

sup | Y M (i) - e Q P e A+t sgn 77+ 1 A 0, e -> 0. 

t<r £ 

Proof. The lemma follows from Duhamel's principle and Lemma [TBI □ 

The following result is essentially Lemma 8.9 from [2]. It holds true in our setting 
since its proof is based only on the conclusion of Lemma 1171 

Lemma 18. As e — > 0, 

N-(T e )^N, 
where N is the Gaussian random variable in (12311. 



We finish the proof of Lemma [71 Recall that the process M e was defined in ([48 
and introduce the stochastic processes 

(54) R e {t)= [ ' e- x -^-^H 2 (Y e (s))ds. 

Jo 

Note that ((S3) and (gSJ) imply 

Y e , 2 (re) = e~ x - T >Y e , 2 (0) + f ' e- x -^^H 2 {Y t {s),e)d S + eNZ(T t ) 



n 



= e -A_r. c a(l-p)X_/A + ^- + e M e (r e ) + R e (r e ) 
(55) = 7?- fi^H ' + e aX -' x + + eM e (r e ) + R e (r e ). 



Relations {H) and ([52]) imply 



(56) C(^) ^> (^) y 2 . 



Law ( |J7q 
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Lemma [18] and estimate (|5"Tj) imply 

(57) M e (r e ) ^N, e -»• 0. 

Equations (|55|) and (j57) describe the behavior of first two terms in ([55)1 and the 
proof of the lemma will be complete as soon as we show that 

(58) e-PR e {r e ) A 0, e -4 0. 

We can write the following rough estimate based on (|5Tj) and Theorem [5j 

(59) sup = P (e 2Q(1 - p)A - /A+ ). 

t<T £ 

This is not sufficient for our purposes. We shall need a more detailed analysis 
instead. First, note that 



sup \Y e , 2 (t) - eM e (t) - Re(t)\e x - 1 = £ <*U-pM-M + fa = P (e° 

t<T e 



,\-t _ £ q(i-p)a_/a + I — _ n T ,('f Q ( 1 "P) A -/ A +' 
Hence, for any 7 > there is a K 1 > such that the event 

D £ = (sup |F e , 2 (i) - eM e (t) - i? e (i)|e A -* < K 7 e a ^ x -/ X + 1 

has probability P(-D e ) > 1 — 7 for e > small enough. Moreover, using Theorem[5] 
we see that for some constant Kp > 0, 



\R e (t)\ <K P I e 

JQ 



A_(t-s) v 2 



YfJs)ds. 



Then, using the inequality (a — b) 2 < 2a 2 + 2b 2 and defining Kp~, = KpK~, we see 
that on D f for each t < r f , 



\Re(t)\ < K fi e~ x -* f (e x - s Y e , 2 (s)) 2 e- x - s ds 
Jo 



<2K p . 1 e- x - t / e~ x - s e 2a ^^ x -/ x +ds + 2K p I e- x - ( - t ~ s ^\eM e (s) + R e (s)\ 2 ds 
Jo Jo 

(60) 

Jo 

where M Cj00 = sup 4<Tt |M e (i)|, so that (according to Lemma [5]) M ej00 is slowly 
growing. Due to (|5"9")) we can find a constant K' > (independent of e > and 
t > 0) so that the event 



D' e =D e D I sup \R e (t)\ < ^ e «(i-f)A-/A+ j 



has probability P(D' e ) > 1 — 7 for all e > small enough. Hence, multiplying both 
sides of (|60l) by e A -*, we see that for some constant C 7 > and all t<r e , 

e x - t \R t (t)\l v ,<a(t)+G\e a( - 1 - p '> x -/ x + f e x - s \R e {s)\l v ,ds, 



where 

(61) a(t) = C 7 e 2ail - p)x - /x + + C 7 e 2 M e 2 0O e A -*. 



NORMAL FORMS APPROACH TO DIFFUSION NEAR HYPERBOLIC EQUILIBRIA 21 



Using Gronwall's lemma and (jBTj) we get 



< a(t) + ch 3a ^-^ x -/ x +te c ^' 



»(l-p)A_/A 



+ ^ e 2+a(l-p)A-/A +M 2 oote A_ t+ C T£ °< 1 -)-/^^ 
A 

Hence, 

l^liieWI < C^-^-'^e-^il + C7 7e Q ( 1 -rf A -/ A +ie c ^ 2a( " P,A - A+t ) 

+C y e a A£ ot) (l + ^L e a ^ x -/ x +te c ^ ail ~ P)X - /k+ ). 
A— 

Using (US]), we get that for any q > 0, 

l^|i? e (r e )| = Op (>(i- P )A_/A +e -A_Te +e 2 Me 2 i00 ) 

= Op ^ e Q A-/A + +Q(l-p)A_/A + + £ 2-<^ ^ 

so that (|55|) follows, and the proof is complete by choosing q small enough. 
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